首页> 外文OA文献 >Study on Quality Improvement of HMM-Based Synthesized Voices Using Asymmetric Bilinear Model

【2h】

Study on Quality Improvement of HMM-Based Synthesized Voices Using Asymmetric Bilinear Model

机译：基于非对称双线性模型的基于HMM的合成语音质量改进研究

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Hidden Markov model (HMM)-based synthesized voices are intelligible but not natural especially under limited-data conditions due to over-smoothed speech spectra. Improving naturalness is a critical problem of HMM-based speech synthesis. One solution is to use voice conversion techniques to convert over-smoothed spectra to natural spectra. Although conventional conversion methods transform speech spectra to natural ones to improve naturalness, they cause unexpected distortions in the intelligibility of synthesized speech. The aim of the study is to improve naturalness without reducing the intelligibility of synthesized speech by employing our novel asymmetric bilinear model (ABM) to separate the intelligibility and naturalness of synthesized speech. In the study, our ABM was implemented on the modulation spectrum domain of Mel-cepstral coefﬁcient (MCC) sequences to enhance the ﬁne structure of spectral parameter trajectory generated from HMMs. Subjective evaluations carried out on English data conﬁrmed that the achieved naturalness of the method using the ABM involving singular value decomposition (SVD) was competitive with other methods under large-data conditions and outperformed other methods under limited-data conditions. Moreover, modiﬁed rhyme test (MRT) showed that the intelligibility of synthesized speech was well preserved with our method.

机译：基于隐马尔可夫模型（HMM）的合成语音是可理解的，但不是自然的，尤其是在由于语音频谱过度平滑而在有限数据条件下。改善自然性是基于HMM的语音合成的关键问题。一种解决方案是使用语音转换技术，将过度平滑的频谱转换为自然频谱。尽管常规转换方法将语音频谱转换为自然频谱以提高自然度，但它们会导致合成语音的清晰度出现意料之外的失真。该研究的目的是通过使用我们的新型非对称双线性模型（ABM）来分离合成语音的清晰度和自然度，从而在不降低合成语音的清晰度的情况下提高自然度。在这项研究中，我们的ABM在Mel倒谱系数（MCC）序列的调制频谱域上实施，以增强HMM产生的频谱参数轨迹的精细结构。对英语数据进行的主观评估证实，使用大数据条件下的包含奇异值分解（SVD）的ABM方法所获得的自然性与其他方法相比具有竞争优势，而在有限数据条件下则优于其他方法。此外，改进的韵律测试（MRT）表明，使用我们的方法可以很好地保留合成语音的清晰度。

著录项

作者
Dinh-Anh, Tuan; Morikawa, Daisuke; Akagi Masato;
展开▼
作者单位

展开▼
年度 2016
总页数
原文格式 PDF
正文语种 en
中图分类

相似文献

外文文献
中文文献
专利

1. Quality improvement of HMM-based synthesized speech based on decomposition of naturalness and intelligibility using asymmetric bilinear model with non-negative matrix factorization [J] . Anh-Tuan DINH, Masato AKAGI 電子情報通信学会技術研究報告. 信号処理. Signal Processing . 2015,第522期

机译：使用非负矩阵分解的不对称双线性模型基于自然和清晰度分解的基于HMM的合成语音质量改进
2. Quality improvement of HMM-based synthesized speech based on decomposition of naturalness and intelligibility using asymmetric bilinear model with non-negative matrix factorization [J] . Anh-Tuan DINH, Masato AKAGI 電子情報通信学会技術研究報告. 音声. Speech . 2015,第523期

机译：使用非负矩阵分解的不对称双线性模型基于自然和清晰度分解的基于HMM的合成语音质量改进
3. Quality improvement of HMM-based synthesized speech based on decomposition of naturalness and intelligibility using asymmetric bilinear model with non-negative matrix factorization [J] . Anh-Tuan DINH, Masato AKAGI 電子情報通信学会技術研究報告. 応用音響. Engineering Acoustics . 2015,第521期

机译：使用非负矩阵分解的不对称双线性模型基于自然和清晰度分解的基于HMM的合成语音质量改进
4. Quality improvement of HMM-based synthesized speech based on decomposition of naturalness and intelligibility using non-negative matrix factorization [C] . Anh-Tuan Dinh, Masato Akagi 2016 Conference of The Oriental Chapter of International Committee for Coordination and Standardization of Speech Databases and Assessment Technique . 2016

机译：基于非负矩阵分解的自然性和清晰度的基于HMM的合成语音质量改进
5. Quality modeling and improvement of university facilities services using Six-Sigma – A case study on Wayne State University FPM services [D] . Isa, Mohsen Farag Mohamed 2013

机译：使用6西格玛（Six-Sigma）进行大学设施服务质量建模和改进-以韦恩州立大学FPM服务为例
6. Improving STI and HIV Passive Partner Notification using the Model for Improvement: A Quality Improvement Study in Lilongwe Malawi [O] . MM Matoga, MC Hosseinipour, E Jere, -1

机译：使用改善模型改善性传播感染和艾滋病毒/艾滋病被动伙伴通知：马拉维利隆圭的一项质量改善研究
7. A Study of Bilinear Models in Voice Conversion [O] . Popa, Victor, Nurminen, Jani, Gabbouj, Moncef 2011

机译：语音转换中的双线性模型研究

Study on Quality Improvement of HMM-Based Synthesized Voices Using Asymmetric Bilinear Model

摘要

著录项

相似文献

相关主题

期刊订阅